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Abstract 

We critically analyze the problem of formulating duality between fringe visibility and 
which-way information, in multibeam interference experiments. We show that the tradi- 
tional notion of visibility is incompatible with any intuitive idea of complementarity, but 
for the two-beam case. We derive a number of new inequalities, not present in the two- 
beam case, one of them coinciding with a recently proposed multibeam generalization of 
the inequality found by Greenberger and YaSin. We show, by an explicit procedure of 
optimization in a three-beam case, that suggested generalizations of Englert's inequality, 
do not convey, differently from the two-beam case, the idea of complementarity, according 
to which an increase of visibility is at the cost of a loss in path information, and viceversa. 



1 Introduction 



Interferometric duality, as complementarity between fringe visibility and which-way informa- 
tion is called today, has a long, perhaps a surprisingly long history (for a recent review, see j^). 
It was the central issue of the famous debate between Einstein and Bohr, on complementarity. 
Even if, already at that time, in defending complementarity against Einstein's criticism, Bohr 
pointed out that not only the system under observation, but also the measuring apparatus 
should be regarded as a quantum object |2], the discussion was essentially semiclassical in na- 
ture. As it was based essentially on the position-momentum Heisenberg uncertainty principle, 
it considered only the two extreme cases, of either a purely particle- like or a purely wave-like 
behavior of the system. It was only in 1979 that Wootters and Zurek, j^j gave the first full 
quantum mechanical treatment of Young interference, in the presence of a which-way detector. 
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They recognized that "in Einstein's version of the double-sht experiment, one can retain a 
surprisingly strong interference pattern by not insisting on a 100% rehable determination of 
the sht through which each photon passes" . 

By now, a consistent and simple formulation of interferometric duality has been achieved 
in the case of two interfering beams. In the absence of a which-way detector, Greenberger 
and YaSin |^ , showed that it was possible to convert the basic quantum mechanical inequality 
Trp^ < 1, into one connecting the fringe visibility to the predictability of the path, based on 
unequal beam populations. This is an experimentally testable inequality, as it involves physi- 
cally measurable quantities. For pure states, when the inequality is saturated, this statement 
becomes a formulation of interferometric duality; any increase in predictability is at the cost 
of a decrease in visibility, and vice versa. 

In the case of an interference experiment performed in the presence of a which-way de- 
tector, in order to gain information on the path, one needs to carry out a measurement on 
the detector, after the passage of each quanton. Since, in general, no measurement ensures 
an unambiguous path reconstruction, the determination of the best possible measurement is a 
matter of statistical decision theory, that requires an a priori choice of an evaluation criterion. 
In their pioneering work, Wootters and Zurek, 0, used Shannon's definition of information 
entropy [3] in order to evaluate the which-way information gained after the measurement. Fol- 
lowing this suggestion, Englert by using a different criterion for evaluating the available 
information, was able to establish an inequality, stating that the sum of the square of the 
distinguishability, that gives a quantitative estimate of the way*, and the visibility squared, is 
bound by one. Again, the inequality is saturated for pure states, turning into a statement of 
interferometric duality: any gain in distinguishability is paid by a loss in visibility and vice 
versa. 

In the present paper we discuss the issue of formulating interferometric duality, in the case of 

multibeam experiments. As an example of the problems arising, we may refer to an experiment 

[7] with four beams, in which the surprising result is found that scattering of a photon by one 

of the beams, may lead to an increase of visibility, rather than to an attenuation. (For a 

comment see [S|). To get a better understanding of this experiment, we build an analytical 

three-beam example, which shows that, differently from the two-beam case, the traditional 

visibility may increase, after an interaction of the beams with another quantum system. This 

points towards the need for a different notion of visibility, and one possibility is offered in 

where the visibility is defined as the properly normalized, rms deviation of the fringes intensity 

from its mean value. We briefly review Diirr's |H] derivation, for the multibeam case, of an 

inequality similar to the one of Greenberger and YaSin [1], that relates this new notion of 

visibility, to a corresponding newly defined predictability. Again, in the case of pure state the 

inequality is saturated and, then, in analogy with the two beam case, may be taken as a formal 

definition of interferometric duality. However, as we will discuss later, this is at the cost of 

*In Ref. |S] the distinguishability is expressed in terms of the optimum likelihood Copt for "guessing the way 
right" . This optimum likelihood is one minus the optimum average Bayes cost Copt 
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using a definition of predictability tliat lias some liow lost contact with the ability of guessing 
the way right. Furthermore we show how, in the multibeam case, it is possible to construct new 
inequalities, resulting, like the one of Greenberger and YaSin, from basic quantum mechanical 
properties of the density matrix. Each of them can be written in terms of quantities that, 
in principle, may be measured in interference experiments, such as higher momenta of fringes 
intensity. The new inequalities then provide, exactly as the original one, independent tests 
on the validity of quantum mechanics in multibeam interference experiments. They also are 
saturated for pure states, but, at least at first sight, they do not seem to convey any simple 
relation with the principle of complementarity. 

Then we turn to the more interesting problem of complementarity in the presence of a 
which- way detector. By introducing two alternative definitions of distinguishability, Diirr 
constructed a generalization of Englert's inequality to the multibeam case, proposing to look 
at it as a formal definition of interferometric duality. We show that, apart from the two beam 
case, the new inequality holds as an equality only for the extreme cases where either the 
visibility or the distinguishability vanishes, even when the beams and the detector are both 
prepared in pure states. Then, there may be cases in which the distinguishability and the 
visibility both increase or decrease at the same time. This is in sharp contrast with the idea of 
complementarity, according to which "...the more clearly we wish to observe the wave nature 
...the more information we must give up about... particle properties" In a recent paper 
jlUj . an example in which this situation occurs was constructed. However, we considered there 
an extremely simplified model for the detector, having a two-dimensional space of states. A 
realistic model requires an infinite Hilbert space of states, and we analyze it in this paper. 
This is a much harder problem, because the task of determining the path distinguishability 
implies the solution of an optimization problem, that has to be performed now in an infinite 
dimensional space. We report the full proof in this paper, not only for the sake of completeness, 
but also because it provides an example in quantum decision theory, which is a subject where 
few general results are known, and few cases can be actually treated. Surprisingly, in the case 
we examined, the distinguishability of the infinite-dimensional problem coincides with the one 
found in ^0] , for the simplified model. This shows that the conclusions drawn in ^U] have full 
generality, showing that the notion of interferometric duality in the multibeam case has not 
been yet properly formulated. 

The paper is organized as follows: in Sec. II we discuss interferometric-duality schemes, 
not involving which way detectors. In Sec. Ill we derive a new set of inequalities, not present 
in the two-beam case, and we comment on them. In Sec. IV, which-way detection schemes are 
treated, while in Sec. V, we discuss the optimization problem for a three-beam example. Sec. 
VI is devoted to our concluding remarks. 
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2 Visibility and Predictability. 



We consider an n-beam interferometer, in which a beam sphtter sphts first a beam of quantum 
objects ("quantons" , in brief) into n beams, that afterwards converge on a second beam sphtter, 
where they interfere, giving rise to n output beams. We imagine that, at some instant of time, 
the (normahzed) wave-functions > i = 1, . . . ,n for the individual beams are fully localized 
in the region between the two beam-splitters, and are spatially well separated from each other, 
so that < >= Sij. The state of the quanton, in front of the second beam-splitter, is then 

described by a density matrix p of the form: 

P = IIPij l^i >< V'jl . (2.1) 

The diagonal elements pu represent the populations Q of the beams, and obviously they satisfy 
the condition: 

= TVp = l. (2.2) 

i 

The off-diagonal elements of p, that we shall denote instead related to the 

probability / of finding a quanton in one of the n output beams, according to the following 
equation: 

<^^)/,,j . (2.3) 

Here, (pi — (pj is the relative phase between beams i and j. In this paper we consider exper- 
imental settings, such that all these relative phases can be adjustable at will. However, this 
is not the case in a number of experimental settings, where the features of the apparatus may 
lead to relations among the relative phases of the beams. When this happens, the output 
beam intensity Ea. H2.3|) may be rewritten, by expressing the relative phases in terms of the 
independently adjustable ones. An analysis of complementarity tailored on specific experimen- 
tal settings, involving definite relations among the phases, may turn out to be interesting and 
useful. However, the purpose of the present paper is to study the problems arising when the 
full freedom allowed by an n-beam setting is taken into account. 

Going back to Ea. H2.3|) . one notices that / does not depend at all on the populations Ci- 
In the standard case of an interferometer with two beams of interfering quantons, a typical 
measure of the fringe contrast is the traditional visibility V, defined as: 

y _ -^max ~ -^min ^2 ^'j 

where /max and /min are, respectively, the maximum and minimum of /. It is easy to verify, 
using Ea. (|2.3() with n = 2, that 

V = 2|/i2|. (2.5) 
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A few years ago, Greenberger and YaSin |i4| noticed that the general rules of Quantum Me- 
chanics imply the existence of a simple relation connecting the visibility V, to the populations 
Q of the beams. They considered the so-called predictability 

r := ICi - C2I , (2.6) 

which can be interpreted as the a-priori probability for " guessing the way right" , when one has 
unequal populations of the beams. It is easy to verify that the general condition 

Trp2 < 1 , (2.7) 

turns into the following inequality 

V2 + p2 < _ ^2.8) 

When it is saturated, namely for pure states, one can recognize in Ea. (|2.8|) a statement of 
wave-particle duality, because then a large predictability of the way followed by the quantons, 
implies a small visibility of the interference fringes, and viceversa. 

Independently on any interpretation, the inequality 1)2. 8|) represents a testable relation 
between measurable quantities, that follows from the first principles of Quantum Mechan- 
ics. Indeed, the experiments with asymmetric beams of neutrons made by Ranch et al. jllj 
are compatible with it. It is interesting to observe that Ea. (|2.8|l provides also an operative, 
quantitative way to determine how far the beam is from being pure. 

One may ask whether an inequality analogous to Ea. (|2.8jl holds in the multibeam case. 
Here, one's first attitude would be to keep the definition of visibility, Ea. (|2.5|) . unaltered. 
However, this choice has a severe fault, as we now explain. Suppose that the beams are made 
interact with another system, that we call environment, and assume that the interaction does 
not alter the populations of the beams. If the interaction is described as a scattering process, 
its effect is to give rise to an entanglement of the beams with the environment, such that: 

Ixo >< Xo\^ Pb&e = ^Pij \Xi>< Xj\ ^ \^i>< i^jl ■ (2.9) 

Here, \xo > and \xi > are normalized environments' states (we have assumed for simplicity 
that the initial state |xo > of the environment is pure, but taking a mixture would not change 
the result). The entanglement with the environment alters the probability of finding a quanton 
in the chosen output beam. Indeed, the state p' of the beams, after the interaction with the 
environment, is obtained by tracing out the environment's degree of freedom from Ea. ()2.9() : 

By plugging p' into Ea. (|2.3|) . we obtain the new expression for the probability /' of finding a 
quanton in the selected output beam: 

I' = l\^ + T.T. e^^^^-^^)/.,<X.|x,>) . (2.11) 
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If we agree that the visibihty V should be fully determined by the intensity of the output beam 
/', we require that it should be defined in such a way that, for any choice of the environments 
states Ixi >! ^ It is easy to convince oneself that the standard visibility V fulfills this 
requirement for two-beams, while it does not for a larger number of beams. Indeed, for two 
beams, y < ^ is a direct consequence of Ea. H2.5|) . Things are different already with three 
beams. Consider for example the three-beam state, described by the following density matrix 
P 

/ 1 -A A \ 



1 

P=3 



V 



-A 1 -A . (2.12) 
A -A 1 y 

It can be checked that p is positive definite if < A < 1. A direct computation of the visibility 
V, for A > 0, gives the result: 

V = . (2.13) 



2 + X 

Suppose now that the interaction with the environment is such that the environment's states in 
Eq. (|2.9j) satisfy the conditions: |xi >= \X2 > and < Xi\X3 >=< X2\X3 >= 0- This condition 
is typically realized if the environment interacts only with the third beam, as it happens, for 
example, if one scatters light off the third beam only. This is precisely the type of situation 
that is realized, in a four beam context, in the experiment of Ref.[71. With this choice for the 
states Ixi >i the density matrix p' in Ea. (|2.1U() becomes: 

/ 1 -A \ 



V 



1 

-A 




(2.14) 



It can be verified that the new value of the visibility V' is: 

4 



v 



-A 



(2.15) 



We see that, for 1/4 < A < 1, V' > V. We believe that these considerations lead one to 
abandon V as a good measure of the visibility, in the multibeam case, and to search for a 
different definition. 

Thus we need multibeam generalizations of the above definitions for the visibility and the 
predictability. Of course, this is a matter of choice, but it is clear that the choices for the 
definitions of the two quantities are tied to each other, if they are eventually to satisfy an 
inequality like Ea. l|2.8jl . Indeed a simple reasoning provides us with a possible answer. One 
observes that, for any number of beams, it is still true that Trp^ < 1. Upon expanding the 
trace, one can rewrite this condition as: 



Ecf + EE 



< 1 



(2.16) 



One observes now that the first sum depends only on the populations Q of the beams, which 
should determine the predictability, while the second sum depends only on the non diagonal 
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elements of p, which are the ones that appear in the expression of the intensity / of the output 
beam, Ea. (|2.3() . and thus determine the features of the interference pattern. Ea. H2.16() suggests 
that we define the generahzed visibihty V as: 

^' = CEEl^^.f ' (2-17) 

where C is a constant, chosen such that the range of values of V is the interval [0, 1]. One 
finds C = n/(n — 1), and so we get: 



V 



n 



lEEl^^.P' (2-18) 



n - ■ ... 



which is the choice made in [Sj. It is clear that this definition of V satisfies the above require- 
ment, that any interaction with the environment should make V decrease, because, according 
to Ea. H2.9|) . the moduh can never get larg result of the interaction with the envi- 

ronment. Moreover, we see that for two beams V = 2|Ii2|, which coincides with Ea. H2.5() . and 
soV = V. It is easy to check that V can be expressed also as a rms average, over all possible 
values of the phases (pi, of the deviation of the intensity I of the output beam from its mean 
value: 

V = \/^<(A/)2 >] . (2.19) 

Here the, bracket < denotes an average with respect to the phases (pi and A/ = /— < / 

One proceeds in a similar manner with the generalized predictability P. Ea. ()2.16|l suggests 
that we define P as: 

P^ = AY,Ci+B, (2.20) 

i 

where the constants A and B should be chosen such that the range of values of P^ coincides 
with the interval [0,1]. It is easy to convince oneself that this requirement uniquely fixes 
A = n/{n — 1), B = — 1/n, and so we obtain: 



P 



\ 




(2.21) 



which is the choice of ^Q^. It is easy to check that this expression coincides with V, Ea. H2.6|l . 
when n = 2. One may observe that this definition enjoys the following nice features: 

i) P reaches its maximum value if and only if either one of the populations C,i is equal to one, 
and the others are zero, which corresponds to full predictability of the path; 

ii) P reaches its minimum if and only if all the populations are equal to each other, which 
means total absence of predictability; 

ii) P and P^ are strictly convex functions. This means that, for any choice of two sets of 
populations = (Ci; • • • > O) and C" = (Ci i • • • > C) ^.nd for any A G [0, 1] one has: 

P(AC' + (1 - A)C") < AP(C') + (1 - A)P(C") , (2.22) 
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where the equahty sign holds if and only if the vectors C' and C," coincide. A similar equation 
holds for . This is an important property, because it means that the predictability (or 
its square) of any convex combination of states is never larger than the convex sum of the 
corresponding predictabilities (or their squares). 

One can check now that and V"^ satisfy an inequality analogous to Eq.((THJ: 

V"^ + P"^ <l, (2.23) 

where the equal sign holds if and only if the state is pure. This result deserves a number of 
comments: 

1) As in the two beams case, the above inequality provides a testable relation between mea- 
surable quantities, and it would be interesting to verify it. 

2) On the level of interpretations, when saturated, Ea. (|2.23|) can be regarded as a statement 
of wave-particle duality, in analogy with the two-beam relation, Ea. (|2.8() . In fact, since the 
quantity P depends only on the populations Ci, P may be interpreted as a particlelike attribute 
of the quantons. On the other side, since the quantity V depends only on the numbers /jj, 
that determine the interference terms in the expression of /, it is legitimate to regard y as a 
measure of the wavelike attributes of the quanton. 

3) However, the quantity P does not carry the same meaning as the quantity V used in the 
two-beam case, and the name "predictability" given to it in Ref.[S] is not the most appropri- 
ate. Indeed, from the point of view of statistical decision theory the natural definition of 
predictability would not be that in Eo. 1)2. 21(1 . bur rather the following. If one interprets the 
number Q as the probability for a quanton to be in the beam i, and if one decides to bet every 
time on the most populated beam i, the sum X^j^i d represents the probability of loosing the 
bet. Then, it is natural to define the predictability Vn as: 

Vn = l tEC.' (2-24) 

n — 1 ~i 

where the normalization is fixed by the requirement that Vn = 0, if the beams are equally 
populated, and Vn = 1, if any of the populations is equal to one. For n = 2, this definition 
reduces to that used by Greenberger and YaSin, in Ea. (|2.6|) . and in fact it was proposed as a 
generalization of it in Ref. jl3j. It is surely possible to write inequalities involving Vn and V, 
but, as far as we know, none of them is saturated by arbitrary pure states, differently from 
Ea. ()2.23|) . So, one is faced with a situation in which the less intuitive notion of "predictability", 
given by Eo. 1)2. 21(1 . enters in a sharp relation with the visibility, while the most intuitive one, 
given by Eo. 1)2. 24(1 . enters in a relation with the visibility, that is not saturated even for pure 
states. 



3 Higher order inequalities. 

In a multibeam interferometer a new interesting feature is present, which is absent in the 
two-beam case, and puts Ea. (|2.8|) into a new perspective. In fact, Ea. (|2.23|l . that relates the 
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populations of the beams Q to the features of the interference fringes, is only the first of a 
collection of inequalities, that we now discuss. The new inequalities, exactly like Ea. ()2.23|l . 
rest on the first principles of Quantum Mechanics and can be derived along similar lines, by 
considering higher powers of the density matrix p. Indeed, for n beams, one has the following 
n — 1 independent inequalities: 

Trp*" < 1 m = 2, ...,n. (3.1) 
For example, with three beams, if we take m = 3 we obtain: 

0<^C' + 3 5]OEl%l' + 3Ui2W3i+h.c.) < 1 . (3.2) 

i i jj^i 

This inequality, like Ea. ()2.23|) . may be translated in terms of physically measurable quantities, 
although in a more elaborate way. First, we notice that the combination of non-diagonal 
elements of the density matrix, that appears in the last term of the r.h.s. of the above Equation 
represents the third moment of the intensity / of the output beam: 

(/l2/23/31+h.C.) = ^^^^2^. (3.3) 

On the other side, the quantities that appear in the middle terms, are related, as in 

Ea. ()2.5() . to the visibilities Vij of the three interference patterns, that are obtained by letting 
the beams i and j interfere with each other, after intercepting the remaining beam. Therefore, 
we may rewrite Ea. ()3.2|) as: 

0<ECf + ^EC.E^.^ + 3l^^<l, (3.4) 

i i jj^i <t> 

which shows clearly that the novel inequality is a testable relation, to be checked by experiment. 

This example illustrates the general structure of the new higher order inequalities. As the 
number n of beams and the power of m in Ea. ()3.1() increase, higher and higher moments of 
the intensity / will appear. Furthermore, data related to the interference patterns formed by 
all possible subsets of beams that can be sorted out of the n beams, will appear. 

A few comments are in order. On one side, the higher order inequalities are similar to 
Ea. ()2.23|) . in that they are all testable in principle, and become equalities for beams in a pure 
state. On the other side, differently from Ea. H2.23() . they do not exhibit a natural splitting 
of the particlelike quantities Q from the wavelike quantities lij, into two separate, positive 
definite terms. 

The existence of this sequence of inequalities suggests that, from the point of view of 
complementarity, the two-beam and the multibeam case are different. For two-beams, the 
basic properties of the density matrix are completely expressed in terms of a single duality 
relation, like Ea. H2.8|) . In the multibeam case, a whole sequence of independent inequalities is 
needed, if one is to fully express the basic properties of the density matrix. Except for the first 
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one, none of these inequalities seems to be related in any simple way to the intuitive concept 
of wave-particle duality. It seems than that the lowest-order inequality, Ea. (|2.23|) . still carries 
an idea of wave-particle duality, but only at the cost of averaging out the effects related to 
higher order moments. 

4 Which-way detection. 

The notion of predictability, introduced in Sec. II, does not express any real knowledge of the 
path followed by individual quantons, but at most our a-priori ability of predicting it. A more 
interesting situation arises if the experimenter actually tries to gain which-way information 
on individual quantons, by letting them interact with a detector, placed in the region where 
the beams are still spatially separated. The analysis proceeds assuming that the detector also 
can be treated as a quantum system, and that the particle-detector interaction is described by 
some unitary process. A detector can be considered as a part of the environment, whose state 
and whose interaction with the beams can, to some extent, be controlled by the experimenter. 
If we let Ixo > be the initial state of the detector (which we assume to be pure, for simplicity), 
the interaction with the particle will give rise to an entangled density matrix pb&o of the 
form considered earlier, in Ea. H2.9|l . This time, however, we interpret the states \xi > as n 
normalized (but not necessarily orthogonal !) states of the which-way detectors. The existence 
of a correlation between the detector state \xi > and the beam {ipi >, in Ea. ()2.9() . is at the 
basis of the detector's ability to store which-way information. We observed earlier that the very 
interaction of the quantons with the detector, causes, as a rule, a decrease in the visibility. 
According to the intuitive idea of the wave-particle duality, one would like to explain this 
decrease of the visibility as a consequence of the fact that one is trying to gain which-way 
information on the quantons. In order to see if this is the case, we need read out the which- 
way information stored in the detector. We thus consider the final detector state pD, obtained 
by taking a trace of Ea. ()2.9|) over the particle's degrees of freedom: 



As we see, is a mixture of the n final states |xi >) corresponding to the n possible paths, 
weighted by the fraction Q of quantons taking the respective path. Thus the problem of 
determining the trajectory of the particle reduces to the following one: after the passage of 
each particle, is there a way to decide in which of the n states \xi > the detector was left? 
If the states \xi > are orthogonal to each other, the answer is obviously yes. If, however, 
the states \xi > are not orthogonal to each other, there is no way to unambiguously infer 
the path: whichever detector observable W one picks, there will be at least one eigenvector 
of W, having a non-zero projection onto more than one state \xi >■ Therefore, when the 
corresponding eigenvalue is obtained as the result of a measurement, no unique detector-state 
can be inferred, and only probabilistic judgments can be made. Under such circumstances, 
the best the experimenter can do is to select the observable that provides as much information 




(4.1) 
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as possible, on the average, namely after many repetitions of the experiment. Of course, this 
presupposes the choice of a definite criterion to measure the average amount of which-way 
information delivered by a certain observable W. 

Let us see in detail how this is done. Consider an observable W, and let 11^ the projector 
onto the subspace of the detector's Hilbert space Tin, associated with the eigenvalue w^. The 
a-priori probability of getting the result is: 

= Ttd (n^ Pd) = E P^,^ > (4.2) 

i 

where Tr/) denotes a trace over the detector's Hilbert space Tin and Pi^ = \ < x^\^^J.\x^ > P. 
The quantity Q Pj^ coincides with the probability of getting the value Wf^, when all the beams, 
except the i-th one, are intercepted before reaching the detector, and indeed this provides us 
a way to measure the numbers Q Pi^. When the interferometer is operated with n-beams, one 
may interpret the normalized probabilities Qi^: 

Q^, = ^ (4.3) 

as the a-posteriori relative probability, for a particle to be in the i-ih. beam, provided that the 
measurement of W gave the outcome w^. 

On the other side, if W is measured after the passage of each quanton, one can sort the 
quantons in the output beam into distinct subensembles, according to the result of the 
measurement. The subensembles of quantons are described by density matrices of the 
form: 

Pit,) = — TrD (n^ Pb&e) := P{i,)iMi >< V'il , (4.4) 
Pi" ij 

where we defined: 

PMij = ~ < Xj\^^l\x^ > Pij ■ (4.5) 
Pfj. 

We see that the a posteriori probabilities Qj^ coincide with the diagonal elements of the 
density matrices P{f_i)ijj and thus represent also the populations of the beams, for the sorted 
subensembles of quantons. 

Let us consider now the case of two beams. For each outcome tf^, one can consider the 
predictability V^iW) and the visibility V^{W), associated with the corresponding subensemble 
of quantons: 

'P/.iW) = 1/9(^)11 - P(^)22l = IQlM - Q2mI , (4.6) 

V,{W) = 2\p^^)u\ ■ (4.7) 

Notice that both quantities depend, of course, on the observable W. It is clear that an 
inequality like Ea. H2.6() holds for each subensemble, separately: 

VliW) + Vf,iW)<l. (4.8) 

The equality sign holds if and only if the subensemble is a pure state, which is surely the 
case if the beams and the detector are separately prepared in pure states, before they interact. 
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When the eigenvalue is observed, it is natural to define the average amount of which- way 
knowledge delivered by W as the predictability Vf_i(W) of the corresponding subensemble of 
quantons. In order to measure the overall ability of the observable W to discriminate the 
paths, one defines a quantity IC{W) ^, which is some average of the partial predictabilities 
"PfiiW). The procedure implicitly adopted by Englert in is to define IC(W) as the weighted 
average of the numbers V^{W), with weights provided by the a priori probabilities p^: 

IC{W) = Y,p,V,{W). (4.9) 

One can introduce also the "erasure visibility" ^1], relative to W, as the weighted average of 
the partial visibilities: 

V{W) = Y,p^V,{W). (4.10) 

For any W, these quantities can be shown to satisfy the following inequality, that is a direct 
consequence of Eq. (|4.8I) : 

)C^{W)+V^{W)<1. (4.11) 



Moreover, one can prove that: 



< IC^{W) , (4.12) 



which gives expression to the intuitive idea that any observable W, that we decide to measure, 
provides us with a better knowledge of the path, than that available on the basis of a mere a 
priori judgement. One has also the other inequality 

V2 < V^{W) . (4.13) 

For the proofs of these inequalities, we address the reader to Ref.(jlj), where they are derived 
in a number of independent ways. In the so-called which-way sorting schemes, it is natural to 
select the observable W such as to maximize IC{W), and one then defines the distinguishability 
P of the paths as the maximum value of IC{W): 

V = max{/C(P^)} . (4.14) 

It is easy to see that Eas. (|4.11() . (|4.13() and (|4.14l) together imply the following inequality, 
analogous to Ea. H2.8|) . first derived by Englert in Ref.[n]: 

p2 + < 1 . (4.15) 

Thus, given the visibility V, there is an upper bound for the distinguishability, set by the above 
relation. But Englert in fact proves much more than this: he shows that Ea. H4.15() becomes 
an identity, when both the beams and the detector are in a pure state. In our opinion, this 
fact is essential to justify the interpretation of Ea. (|4.15|) as a statement of the complementary 

^Indeed, Englert considers the "Ukehhood Cw for guessing the way right. In our notation, Cw = (1 + 
!C{W))/2. 
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character of the wave and particle attributes of a quanton. In fact, this impUes that, when 
the beam of quantons and the detector are as noiseless as they can possibly be in Quantum 
Mechanics, namely when they are in pure states, an increase in any of the two terms is neces- 
sarily accompanied by an exactly quantifiable corresponding decrease of the other. 
A possible generalization of the above considerations, to the multibeam case, is as follows jS]. 
One sorts again the quantons, into subensembles, depending on the outcome of the measure- 
ment of W. For each outcome w^, one uses the generalized predictability P in Ea. l|2.21j) . and 
the generalized visibility V in Ea. (|2.18|) . to define the "conditioned which- way knowledge" 



and the "partial erasure visibility" Vfj,{W): 



\ 




(4.16) 



V * i^* 

In view of Eq. (|2.23jl , they satisfy an inequality analogous to Eq. (|4.8|1 : 

Kl{W) + V^{W)<l. (4.18) 

Again, as in the two beam case, the equality sign holds if the subensembles are pure. The 
author of Ref.,9: considers now two different definitions for the "which- way knowledge" and 
the "erasure visibility", associated to W, as a whole. The first one is closer to Ea. ()4.9() : 

K{W):=Y,p^K^{W) , V{W):=Y,p^V^{W). (4.19) 

The second one, inspired by the work of Brukner and Zeilinger jl5j . is-^-: 

K\W):=J2Pf^KliW) , V\W):=Y,p,V;^iW). (4.20) 

The quantities introduced above, are related by the following chains of inequalities, the proofs 
of which can be found in j^]: 

V < V{W) < V{W) , P < K{W) < K{W) . (4.21) 

These inequalities show that K(W) and y(VF) provide more efficient measures for the average 
which-way information, and for the erasure visibility, respectively. However, the author of 
Ref.[S] observes that the quantities K{W) and F(VF) are preferable to K{W), and V{W), 
respectively, because they are the ones that reduce, for n = 2, to the definitions used in the 
two-beam case. We would like to point out that, since K^{W) and y^(VF) are essentially 
variances of the diagonal and non-diagonal elements, respectively, of the density matrices for 

"'"We use here a notation different from that of Ref.0. Our K^{W) and V'"^(V[^) correspond, respectively, to 
n/{n — 1)Ikw and n/{n — l)Ivw, in 
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the subensembles of quantons, it appears more natural, from a statistical point of view, to 
combine them in quadrature, as done in Ea. ()4.2U|) . This suggests that one should adopt the 
definition with the quadrature also in the two-beam case. 

By taking the suprema of all the quantities defined above, over all possible observables W, 
one can define a set of four quantities, that characterize the state p of the beams. For example, 
upon taking the maxima of K{W) and K{W), we end up with two possible definitions for the 
which-way distinguishability, D and D, respectively: 

D = mayi{K{W)} , D = max{i^(T^)} . (4.22) 

Similarly, by taking the suprema of ^(VF) and y(W^), we obtain two definitions of the so-called 
"coherence" of the beams 

C = sup{V{W)} , C = sup{V{W)} . (4.23) 
w w 

(The reader may found in Ref.,l! an explanation of why one has maxima, in the definition of 
distinguishability, and only suprema in that of coherence.) 

The quantities introduced above, satisfy a set of inequalities, that all follow from the 
chains of inequalities Eas. (|4.21|) . and from the following inequality, that can be obtained from 
Ea. (|4.18|l . on averaging over all possible outcomes w^: 

K^{W) + V^{W) <1 . (4.24) 

It is clear that this inequality is saturated, regardless of the observable W, when the state of 
the combined detector-beam system is pure. This is an immediate consequence of Ea. (|4.18|) . 

One of the central results of Ref . [H] is the following inequality, generalizing Eq. 1)4. 15(1 : 

D"^ + V"^ <l. (4.25) 



Since D > D, this also implies: 



+ V-^ < 1 . (4.26) 



Thus we see that also in the multibeam case, the visibility V sets an upper limit for the amount 
of which-way information, irrespective of how one measures it, via D or D. In Ref.[n| it is 
suggested that the above two inequalities provide multibeam generalizations of the two-beam 
wave-particle duality relation Ea. H4.15() . 

Even if Ea. ()4.26|) and Eg. 1)4. 25(1 represent correct inequalities, that can be tested in an ex- 
periment, in our opinion, their interpretation as an expression of wave-particle duality appears 
disputable. The root of the problem is that the above inequalities, differently from the two 
beam case, cannot be saturated, in general, even if the beams and the detector are prepared 
in pure states (in Appendix I, we actually prove that Ea. H4.26() . for example, can be saturated 
only if D = P, which means that the detector does not provide any information). Therefore, 
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one may conceive the possibility of designing two which- way detectors Di and D2, such that 
Vi > while, at the same time, Di > D2. This possibility, which conflicts with the intuitive 
idea of complementarity, actually occurs, as we anticipated in Ref.[Tn|, and as we report in the 
next Section. 



5 A three-beam example. 

The example discussed in Ref.^Uj, was based on a three beam interferometer with equally 
populated beams, described by the pure state: 

P=\Y. IV'i ><i^j\ ■ (5-1) 

For the sake of simplicity, it was assumed there that the detector's Hilbert space was a two- 
dimensional space 7i2- Its rays were described via the Bloch parametrization, such that: 

1 + n- a , , , , 
^ = |X><X|, (5.2) 

where n is a unit three- vector and a = (ux, cry,az) is any representation of the Pauli matrices. 
We denoted by |n >< n| the ray corresponding to the vector h. We required that the directions 
h^,h-,hQ, associated with the states \xi >, were coplanar, and such that n+ and n_ both 
formed an angle 9 with no We imagined that 9 could be varied at will, by acting on the 
detector, and in Ref.^O] we obtained the following expressions for the visibility V and the 
distinguishability D, as functions of 9: 



Vie) = ^l±£25i±f25!f , (5.3) 

D{9) = ^ sine for < < 2/3^ , (5.4) 
V3 

2 f 9\ 

D{9) = -sm^{-\ for 2/37r<6'<7r. (5.5) 

The values of V and D are plotted in the figure. By looking at it, one realizes that something 
unexpected happens: while in the interval < < 7r/2, V decreases and D increases, as 
expected from the wave-particle duality, we see that in the interval '/r/2 < < vr, V and D 
decrease and increase simultaneously! We see that if we pick two values 9i and 6*2 in this 
region, we obtain two which-way detectors, that precisely realize the situation described at the 
end of the previous Section. 



The analysis of E,ef.|inj. that we have summarized here, is not realistic though, because of 
the simplifying assumption of a detector with a two-dimensional Hilbert space of states. Even 
assuming that the detector's final states \xi > span a two-dimensional subspace 7i2, still one has 
to take into account that the full Hilbert space TCd oi a realistic device is infinite-dimensional. 
Now, it is known from the theory of quantum detection |12[|17j that the optimum discrimination 
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Figure 1: Plots of the quantities D (solid line), V (dotted line), and + V'^ (dashed line), as 
functions of in a three beam situation. 

among an assigned set of quantum states, is not always achieved by an observable that leaves 
invariant the subspace spanned by them. However, the value of D quoted above corresponds to 
maximizing the which-way knowledge over the restricted set of detector's observables W , that 
leave invariant the subspace 7i2- Then, in order to complete the proof, we need to show that no 
observable in TCd can perform better than the one determined in R,ef.|l()j. by considering only 
operators that live in Ti.2- Filling this gap, is by no means an easy job, because it is a matter of 
solving an optimization problem in an infinite-dimensional Hilbert space. There is no general 
strategy for solving this sort of problems, and we can rely only on few known general results 
^lElEl- The interested reader can find the lengthy procedure to compute D in Appendix 
II. Here, we content ourselves with sketching the method followed, and presenting the results. 

For the sake of definiteness, let us agree to use K{W) as our measure of the which way- 
information. At the end of this Section, we shall discuss what changes if one instead uses 
K(W). The determination of the optimal observable Wopt is facilitated by the observation 
that, even when Ti.£) is infinite-dimensional, the problem can be formulated entirely in the 
subspace 7i2, as we now explain. One observes that the probabilities Pj^ that enter in the 
definition of K(W) can be written also as: 



where H is the orthogonal projector onto 7i2, and = HH^H is a positive (hermitian) 
operator on the subspace 7i2- Thus we see that the operators contain all the information 
we need, about W, in order to compute the which-way knowledge. It is to be noticed that A^ 



Pif, =< Xil^filxi >=< Xj|nn^n|xi >=< xilWxi > 



(5.6) 
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are not projection operators, in general. However, they must provide a decomposition of the 
identity onto 7^2, since: 

5] = ^ n n^n = n(^ n^)n = n. (5.7) 

Such a collection of operators on 7i2, provides an example of what is known in Mathematics as a 
Positive Operator Valued Measure (POVM in short). Notice though that, while any hermitian 
operator in Tin gives rise, by projection, to a POVM in 0.2, the converse may not be true. § 
Our strategy to determine VFopt is then to search first for the optimal POVM Aopt in TI2 (the 
notion of which way knowledge is obviously defined for an arbitrary POVM, as well), and to 
check at the end if Aopt can be realized by projecting onto TC2 an operator W in TCd, as in 
Ea. ()5.6() . If this is the case, W is guaranteed to be optimal, and we can say that D = K{Aopt). 
The determination of Aopt is facilitated by a general theorem |18j . that states that for any 
measure of the which-way knowledge that is a weighted average of a convex function, the 
optimal POVM consists of rank-one operators. This is the case for the which-way knowledge 
K, which is a weighted average of the predictability P, which indeed is a convex function. The 
A^ being rank-one operators, we are ensured that there exist non-negative numbers 2a^ < 1 
and unit vectors such that: 

= 2a^|m^ >< m^l = 0^,(1 + rh^-a) . (5.8) 

The condition for a POVM, Ea. H5.7p is equivalent to the following conditions, for the numbers 
and the vectors rh^: 

^ = 1 , ^ a^m^ = . (5.9) 

The interested reader may find in Appendix II how the optimal POVM can be determined. 
Here we just report the result: for all values of 9, Aopt turns out to have only two non vanishing 
elements, A±, such that: 

for < 6* < 27r/3 , (5.10) 

for 27r/3 < < vr . (5.11) 

It is clear that the operators A± coincide with the projectors found in Ref. jlUj. showing that 
it was indeed sufficient to carry out the optimization procedure in 7^2- 

It should be appreciated that this coincidence is by no means trivial, and strictly depends 
on the choice of K{W) as a measure of which-way knowledge. For example, for 9 = 27r/3, 
it is known |121 117j . that, with either Shannon's entropy or Bayes' cost function as measures 
of information, the optimal POVM actually consists of three elements, and thus it is not 
associated with an operator in 7i2- 

^In effect, this problem arises only if TCd is finite dimensional. If Tio is infinite dimensional, all POVM's are 
acceptable, because a general theorem due to Neumark 1191 ensures that all POVM's of any Hilbert space, can 
be realized as projections of self-adjoint operators from a larger Ifilbert space. 
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Then, our observation that the inequahty Ea. H2.23|) fails to carry the physical picture 
associated with the idea of complementarity is now fully demonstrated. We have checked that 
a similar conclusion can be drawn if, rather than K, one uses the alternative definition of 
distinguishability D provided by Ea. (|4.22|) . In fact, it turns out that the optimal POVM for 
K coincides with the one found earlier, in the interval < ^ < 2/3 vr, and so D = D. The 
proof of this can be found in Appendix II. 

6 Conclusions 

The intuitive concept of Complementarity has found, in the case of two-beams interference 
experiments, a satisfactory, fully quantum mechanical formulation as interferometric duality. 
In this paper, we critically analyzed the difficulties encountered in the attempt of generalizing 
this concept to multibeam experiments, and discussed the shortcomings that are present, in 
our opinion, in recent proposals. It seems to us fair to say that interferometric duality has not 
yet found a proper formulation, in the multibeam case. To justify this conclusion, let us recall 
the different points we have elaborated in the paper. 

In the two-beam case, general quantum mechanical requirements on the density matrix im- 
ply the Greenberger-YaSin inequality, that, when saturated, expresses interferometric duality. 
This inequality has been generalized to the multibeam case leading to a formal definition 
of interferometric duality for more than two beams. The price payed is that the corresponding 
generalized concept of predictability has lost the intuitive connection with minimizing the error 
in guessing the way right. The traditional concept of predictability may enter, together with 
the generalized visibility, in an inequality that is not saturated, and then cannot convey the 
idea of complementarity, which requires that a better visibility is necessarily related to a loss 
in information. 

We have shown that general requirements of quantum mechanics imply new inequalities, 
that are not present in the two beam case. These inequalities are again experimentally testable. 
They deserve further study but, at the present, they do not seem to exhibit a direct relation 
with the idea of complementarity. 

Interferometric duality may be fully analyzed only in the presence of which-way detectors. 
In the two beam case, Englert has shown that the visibility enters, with the distinguishability, 
into an inequality, that is saturated for pure states. As maximizing the distinguishability, 
minimizes the error in guessing the way right by performing a measurement, this relation fully 
expresses interferometric duality. In deriving an analogous inequality for the multibeam case, 
Diirr has introduced two alternative notions of distinguishability. However, we have shown 
that this inequality is never saturated, apart from trivial cases. Then, a pure inequality may 
be consistent with a situation in which an increase (decrease) in visibility goes together with 
an increase (decrease) in distinguishability, contrary to the intuitive idea of interferometric 
duality. We have given a full proof that this possibility actually occurs in a realistic example. 
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The inequalities proposed by Diirr, in terms of generalized visibility and distiguishability, are 
then correct quantum mechanical relations, testable in principle, but they fail to convey the 
idea of interferometric duality. 

It is seems then fair to conclude that interference duality in multibeam experiments has 
not yet been properly formulated. We leave the problem open, but we notice it is by no means 
necessary that quantum mechanics should provide us with an exact formulation of this concept 
in the multibeam case. May be, one should content him(her)self with its formulation in the 
two beam case, where the semiclassical intuitive idea of complementarity was first introduced. 
May be, Quantum Mechanics provides us just with the values of observable quantities, and 
experimentally testable inequalities. The analysis we have performed may hint in this direction, 
but further investigation is required. 
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8 Appendix I 

In this Appendix, we prove the following result: for any number n > 2 of beams in a pure state 
p, and any detector in a pure initial state, the inequality Ea. (|4.26() is satisfied as an equality 
if and only if D = P, namely when the detector provides no information at all. The proof 
consists in showing that the equal sign in Ea. (|4.26jl holds only if the detector states \xi > are 
proportional to each other, which obviously implies D = P. Consider the optimal operator 
Wopt such that K{Wopt) = D (we assume that such an operator exists), and let V(Wopt) be 
the corresponding erasure visibility. It follows then from Eas. (|4.21|) and Eas. (|4.24|) that: 

+ V^<D^+ V\Wopt) = K\Wopt) + V\Wopt) < K\Wopt) + V\Wopt) = 1 • (8.1) 

We see that a necessary condition to have + V'^ = lis that: 

V = ViWopt) . (8.2) 

In what follows, we shall not consider the trivial case V = 0, and we shall suppose that 
V > 0. In order to study Ea. (|8.2|) . we take advantage of the fact that, K{W) being convex, 
the spectrum of Wopt can be taken to be non degenerate ^H]- If we let \Wf^ > the eigenvectors 
of Wopt, with non- vanishing projection onto some of the states \xi >, by using the expressions 



19 



Ea. ()4.17|) for the partial visibilities, we can write: 



n n 



= jimi<^Ml^^il^M >lyimi <WvVpvq\Wv >P , (8.3) 

where 

Pij ■= Pij \Xi >< Xj\ ■ (8-4) 
Now, the Cauchy-Schwarz inequality for real vectors implies that: 



I< '^/^IPijl'^fJ. >P, I < W^\ppq\w^ >|2 > I< W^\pij\Wf, >|-| < W„\pij\w„ > I . 

(8.5) 

Upon using this relation into Ea. H8.3() . we obtain: 

(8.6) 

Obviously: 

^ I < > I > I X! < > I • (8.7) 



Then, Ea. H8.6p becomes: 



n ■ / ■ 



Clearly, V'^iWopt) becomes equal to V, if and only if all the inequalities involved in the deriva- 
tion of Eq. (|8.8|) become equalities. Notice that the case n = 2 is special, for then the Cauchy- 
Schwarz inequalities Eq . (18.51) are necessarily equalities, because the sums in Eq. (|8.5|) contain 
just one term. However, for n > 2, we have the equal sign if an only if there exist positive 
constants such that: 

I < Wf,\pij\w^ > I = Cj.| < Wi^\pij\wu > I , y j . (8.9) 

Since < Wf^lpijlw^ >=< w^\xi >< Xjl^^fi > Pij-: and we assume pij / 0, the above condition is 
equivalent to 

c^_i I < w^\xi >< Xj\WfM > I = c,, I < Wulxi >< Xj\wv > I , i ^ j ■ (8.10) 

On the other side, the set of inequalities Eq. (|8.7j) become equalities if and only, for all j ^ i, the 
phases of the complex numbers < Wf^lpijlw^ >, and then of the numbers < w^\xi >< Xjl^M 
do not depend on fi: 

arg(< w^lxi >< Xj\Wfi >) = % • (8-11) 
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Now, for n > 2 and V > 0, Ea. (|8.1U|) implies that the matrix elements < Wf^\xi > are all 
different from zero. To see it, we separate the states \xi > into two subsets, A and B. A 
contains the detector states which are orthogonal to some of the eigenstates \ w^ >. B contains 
the remaining states. We can prove that, for V > A must be empty. This is done in two 
steps: first we prove that if A contains some detector states, then it contains all of them. In 
the second step, we show that the elements of A are orthogonal to each other. By combining 
the two facts, it follows that A must be empty, because otherwise all detector states would be 
orthogonal to each other, and then, by taking a W that has the detector states as eigenvectors, 
we would achieve D = 1 and l^(Wopt) = 0, which is not possible, because we assumed that 
y > 0. So, let us show first that if A contains some detector states, it contains all. In 
fact, let Ixi > be one of its elements. Then there exists a value of /i, say /j, = 2, such that 

< tt;2|xi >= 0. On the other side, since the vectors \Wf^ > form a basis for the vectors \xi >, 
there must be some eigenvector, say \wi >, such that < li^ilxi >7^ 0. Suppose now that B 
contains an element, say \xn >, and consider Ea. (|8.10() . for i = 1, j = n, /i = 2 and v = 1: 
C2 I < W2\xi X Xn\w2 > | = c„ | < tt^ilxi >< Xn\wi > \. It is clear that the l.h.s. vanishes, 
while the r.h.s. does not. It follows that there cannot be such a \xn >■ Then, if A contains 
just one detector state, it contains all. 

Now we can turn to the second step. In order to prove that all elements of A are orthogonal 
to each other, consider for example Eg. 1)8. 10(1 for fj, = 2 and i = 1: they imply that, for 
any j ^ 1 and any u, the numbers | < Wiy\xi >< Xjl'Wu > \ niust vanish. But this implies 

< Wu\xi X Xjl'^u >= 0. Summing over all values of v, we obtain: 

= H < W'^lxi >< Xj\wu >=< Xjlxi > ■ (8.12) 

V 

So, Ixi > is orthogonal to all other detector states \xi >■ The same reasoning applies to all 
elements of A, and thus we conclude that all detector states are orthogonal to each other. 

Having proved that all matrix elements < Xi\Wf_i > are different from zero, we can now show 
that the detector's states \xi > are indeed proportional to each other. Since n > 2, for any 
i 7^ j, we can find a k distinct from both i and j. Consider now Eo. 1)8.10(1 for the couples 
i, k and j, k, and divide the first by the second. This is legitimate, because all inner products 

< Wf^lxi > are different from zero. We get: 

1 < Wf,\xi > I _ I < WulXi > I 



This is the same as: 



< Wf,\xj > I I < w^lXj > 

< w^lxi > I _ I < Wf^lxj > 



■ (8.13) 



V i / j . (8.14) 



I < Wy\Xi > I I < ■Wy\Xj > I 

Since J2fi I < Wf_i\xi > P = 1 for all i, it is easy to verify that the above equations imply: 

I < w^lxi > I = I < Wf,\xj > I • (8.15) 
To proceed, we make use now of Eg. (|8. 11(1 . If we set a^i = arg < w^\xi >, Eg. (|8.11|) implies: 

Oifii - a^j = 6ij , (8.16) 
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which obviously means that, for fixed i and j and variable //, the phases of the complex numbers 
< Wf^lXi > and < w^\xj > differ by the overall phase 6ij, and this implies: 

\Xi >=e''^^Xj > ■ (8.17) 

Since all detector states differ by a phase, it obviously follows that the detector provides no 
information at all, and thus D = P. 



9 Appendix II 

In this Appendix, we determine the rank-one POVM that maximizes the which-way knowledge, 
for the three beam interferometer considered in Sec.V. The procedure is different, depending 
on whether we choose to measure the which-way knowledge by means of -ftT or i^. We consider 
first because it is the simplest case. We can prove then that, for any number of beams with 
equal populations and any choice of the detector states \xi > in W2, the POVM A that 
maximizes K can be taken to have only two non vanishing elements, A = {^1, ^2}- The proof 
is as follows. First, we notice that, for any rank-one POVM consisting of only two elements, 
the conditions for a POVM, Ea. ()5.9|) . imply: 

1 , , 

ai = «2 = - , mi + m2 = 0. (9.1) 

Thus, all rank-one POVM with two elements are characterized by a pair of unit vectors m^, 
that are opposite to each other. Such a POVM clearly coincides with the Projector Valued 
Measure (PVM) associated with the hermitian operator rhi-a in 7^2- We let A the optimal 
PVM, that can be obtained by considering all possible directions for rhi. We can show that 
such an A represents the optimal POVM. To see this, we prove that the which way-knowledge 
K{A) delivered by A is not less than that delivered by any other POVM C. By virtue of the 
theorem proved in Ref. jl8j. it is sufficient to consider POVM's C made of rank-one operators. 
In order to evaluate K{C), it is convenient to rewrite the quantities p^K^, for any element 
C^, = 2aP (1 + mP ■ a) of C, as 



n — 1 \ n 



^ {-^[1 + (mf- E C.n.)^] + E [1 + i^f^-n.?] + 2mf ). E C. (c. - ^) n.j 

I i i i ) 

(9.2) 

We observe now that, for equally populated beams, Ci = l/'^; the last sum in the above 

equation vanishes, and the expression for Pfj,K^ becomes invariant under the exchange of 

(c) 

with —mil . Consider now the POVM B, such that: 

Bl^ = lc^, B- = ^a(^P{l-mP-a) (9.3) 



1/2 
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Of course, p^p^^ = p^K^/l, while the invariance of p^K^ imphes p\i ■* = p^^^^ K^^\ 
It follows that the average information for B and C are equal to each other, K{B) = K{C). 
Now, for each value of /x, the pair of operators B^/a^^^ = {l±m^i^^ ■a)/2 constitutes by itself 

a POVM, with two elements. Thus, the POVM C can be regarded as a collection of POVM's 

(c) 

with two elements, each taken with a non-negative weight ali . But then K[C), being equal to 
the average of the amounts of information provided by a number of POVM with two elements, 
cannot be larger than the amount of information K{A) delivered by the best POVM with two 
elements. Thus we have shown that K{C) = K{B) < K{A), which shows that A is the optimal 
POVM. 

It remains to find A for the example considered in Sec.V, but this is easy. If we let /3 and 7 the 
polar angles that identify the vector rhi, one finds for the square of the which- way information 
the following expression: 

4 



cos /? sin - + 3 sin /3 cos 7 cos 



sin' ( - ) . (9.4) 



For all values of 6, the which-way information is maximum if cos 7 = ±1, i.e. if the vector rhi 
lies in the same plane as the vectors hi. As for the optimal value of (3, it depends on 6. For 
< 6* < 27r/3, the best choice is (3 = ±7r/2, and one gets the PVM in Eq. (|5.10|) . with gives the 
path distinguishability D given in Eq. (|5.4|) . For larger values of 6, one has /3 = and then the 
optimal PVM is that of Eq. (|5TT|) . with D given by Eq. (|5.5)) . 

We turn now to the case when the which-way information is measured by means of K. Since 
the square of the predictability is a convex function, we are ensured by the general theorem 
proved in jl8j that the optimal POVM is made of rank-one operators, of the form (|5.8I) . We 
split the computation of the optimal POVM in two steps. First, we prove a lemma, which 
actually holds for any measure of the which-way information F, which is a weighted average 
of a convex function of the a-posteriori probabilities Qi^. 

Lemma: consider an interferometer with n beams, and arbitrary populations Q. Let the 
detector states \xi > be in TC2, and have coplanar vectors hi. Then, the optimal POVM is 
necessarily such that all the vectors in Eq. (|5.8|) lie in the same plane containing the vectors 
hi. 

The proof of the lemma is as follows. Let B be an optimal POVM. Suppose that some of the 
vectors mjt do not belong to the plane containing the vectors hi, which we assume to be the 

xz plane. We show below how to construct a new POVM A providing not less information than 

(A) 

B, and such that the vectors all belong to the xz plane. The first step in the construction 
of A consists in symmetrizing B with respect to the xz plane. The symmetrization is done 
by replacing each element i?^ of B, not lying in the xz plane, by the pair {B'^,B'^), where 
B'^ = Bp/2, and B'/^ has the same weight as B'^, while its vector rh!j^^" is the symmetric of 
m\i with respect to the xz plane. It is easy to verify that the symmetrization preserves the 
conditions for a POVM [Eqs. 1)5. 9|) ]. Since all the vectors hi belong by assumption to the xz 
plane, the which way knowledge actually depends only on the projections of the vectors 
in the plane xz. This implies, at is easy to check, that symmetrization with respect to the xz 
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plane does not change the amount of which way knowledge. We assume therefore that B has 
been preliminarily symmetrized in this way. Now we show that we can replace, one after the 
other, each pair of symmetric elements {B'^,B'P by another pair of operators, whose vectors 
lie in the xz plane, without reducing the information provided by the POVM. Consider for 
example the pair {B'^,B'^). We construct the unique pair of unit vectors Uk and v^, lying the 
xz plane, and such that: 

u^ + v^ = 2(mf i + m^^)^ k) , (9.5) 

where i and j are the directions of the x and z axis, respectively. Notice that ^ Vk- Consider 
now the collection of operators obtained by replacing the pair S") with the pair {A'^, A") 
such that: 

yl', = af)(l + n«.a) , A'^ = af ) (1 + a) . (9.6) 

It is clear, in view of Eqs. ()9.5() . that the new collection of operators still forms a resolution of 
the identity, and thus represents a POVM. Equations (|9.5j) also imply: 

pL'^' = Pi^'" = Ml + + )^nf ) = 

= ia.(l + « + <nf) + ia.(l + <nf + <nf) = i(p/,^)' + P^f") , (9.7) 

Now, define A'^ := pif'^' /{2pif^), and A" := pl^^" / {2p^if^ ) , where pi^"* := pi^"*' = plf^". Since 
Pk^^' + pif'^" = 2,p\^\ we have A'^ + A" = 1. It is easy to verify that: 

C^' = C^" = >^.(^' + K(&" , (9.8) 
But then, the convexity of F implies: 

)'F(gf )') +pf )"F(Qf )") = 2pf )F(Qf )') = 
= 2pP)F(A',Q(^)' + A'^Q(^)") < 2pP)[A;F(q1^)') + A'^F(Q(^)")] = 

= p'i^)F(Q'i^))+p(^)"F(Q(^)"). (9.9) 

It follows that the new POVM is no worse than B. By repeating this construction, we can 
obviously eliminate from B all the p pairs of elements not lying in the xz plane, until we get a 
POVM A, which provides not less information than B, whose elements all lie in the xz plane. 
This concludes the proof of the lemma. 

Now we can proceed as follows: we consider the POVM's consisting of two elements only, and 
having its vectors mj parallel to the x axis. By direct evaluation one can check that K(A) 
equals the expression in Ea. (|5.4|) . We can prove that, for < 6* < 27r/3, such an A provides not 

less information than any other POVM, C, consisting of more than two elements. By virtue of 

(c) 

the lemma just proven, we loose no generality if we assume that the all the vectors of C 
lie in the xz plane. Our first move is to symmetrize C with respect to z axis, by introducing a 
POVM B, consisting of pairs of elements {B'^, B^'), having equal weights, and vectors tti'^^ and 
that are symmetric with respect to the z axis: 

B'^ = ^C, , B; = ^ aP (1 - m^a, + m^a,) , (9.10) 
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B provides as much information as C. Indeed, in view of Eq. (??), we find 

<^ = 2Pi7 = 2p(7' , (9.11) 

The invariance of the predictabihty with respect to permutations of its arguments, then ensures 
that K(B) = K{C). Thus, we loose no information if we consider a POVM B, that is symmetric 
with respect to the z axis. Now we describe a procedure of reduction that, apphed to a 
symmetric POVM hke gives rise to another symmetric POVM i?, which contains two 
elements less than B, but nevertheless gives no less information than B. The procedure works 
as follows: we pick at will two pairs of elements of say (B'jy,B'^) and {B'j^_^, B'^_-^^) and 
consider the unique pair of symmetric unit vectors u± = zizu^ i + k such that: 

= (B) ^ (B) ("iv^ "^Sv + «iv-i "^Sv-i) ■ (9-12) 

Consider the symmetric collection 13, obtained from B after replacing the four elements 
(S^,S^,S^_i,5^_J by the pair {B'j^_^, B'^^^^) such that: 

^^-i = («SNf^+«if2i)(l + ^+-^)> = (aivf^+aif2i)(l + n--a) . (9.13) 

B is still a POVM, as it is easy to verify. Moreover, B provides not less information than B, 
as we now show. Indeed, after some algebra, one finds: 

(B) , (B) - I (B) , (B) 9[^N ) m) , (B) 9y^N-\> ' yy-^"^) 
a]v +«iV-l "TV +"7V-1 "iv +"7V-1 

where the function g{x) has the expression: 

3 + x(l + 2cos6l) (l + x)2 + 2(l + 2;cos6')2 + 2(l-x2)sin2 6l 
= 6 + 6 + 2x(l + 2cos0) • 

In view of Eq. ()9.12|) . the r.h.s. of Eq. 1)9.14(1 is of the form 

g{\xi + (1 - A)x2) - \g{xi) - (1 - A) 5(2:2) , (9.16) 

where A = a^^^ /{a^^^ + aj^^i), while xi = m^^^ and X2 = w-jv-i- 1^ may be checked that, for 
all values of 6, such that < ^ < 27r/3, g{x) is concave, for x £ [—1,1], and so the r.h.s. of Eq. 
(|9.16|1 is non-negative for any value of A G [0, 1]. This implies that the r.h.s. of Eq. (|9.14|1 is 
non-negative as well, and so K{B) > K{B). After enough iterations of this procedure, we end 
up with a symmetric POVM consisting of two pairs of elements (Sj,Bj') and {B2,B2). But 
then, the conditions for a POVM, Eqs. ()5.9|) . imply that the quantity between the brackets 
on the r.h.s. of Eq. ()9.12|) vanishes, and so Eq. (|9.12|) gives = 0. This means that the last 
iteration gives rise precisely to the PVM A. By putting everything together, we have shown 
that K{C) = K{B) < K{B) ... < KiA), and this is the required result. 
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